yzBigData: Provisioning Customizable Solution for Big Data
نویسندگان
چکیده
YZStack is our developing solution which implements many wellestablished big data techniques as selectable modules and allows users to customize their systems as a process of module selection. In particular, it includes an openstack based IaaS (Infrastructure as a Service) layer, a distributed file system based DaaS (Data as a Service) layer, a PaaS (Platform as a Service) layer equipped with parallel processing techniques and a SaaS (Software as a Service) layer with popular data analytic algorithms. Layers of YZStack are loosely connected, so that customization of one layer does not affect the other layers and their interactions. In this paper, we use a smart financial system developed for the Zhejiang Provincial Department of Finance to demonstrate how to leverage YZStack to speed up the implementation of big data system. We also introduce two popular applications of the financial system, economic prediction and detection of improper payment.
منابع مشابه
When to use 3D Die-Stacked Memory for Bandwidth-Constrained Big Data Workloads
Response time requirements for big data processing systems are shrinking. To meet this strict response time requirement, many big data systems store all or most of their data in main memory to reduce the access latency. Main memory capacities have grown, and systems with 2 TB of main memory capacity available today. However, the rate at which processors can access this data—the memory bandwidth...
متن کاملOptical networks for cost-efficient and scalable provisioning of big data traffic
This article shows how recent advances in optical networks can be utilized to improve big data processing by cost effective and scalable provisioning of high-bandwidth connectivity for big data traffic in backbone networks and consequently tackle the current problems related to big data processing in distributed environment including cloud computing. We focus on two optical technologies, namely...
متن کاملCSPE: Cloud Storage Provisioning Decided by Rate of Return and Workload Characteristics
As recent report [1] claims, the capacity of digital content on the Internet has amounted to 500 billion GB. What is more, this number is estimated to be double in next year. The emerging of cloud computing offers a rather feasible solution to the problem of information explosion. Thus, for those IT enterprises with high demand of storage, a big concern is to determine whether it is cost effect...
متن کاملCloud Template, a Big Data Solution
Today cloud computing has become as a new concept for hosting and delivering different services over the Internet for big data solutions. Cloud computing is attractive to different business owners of both small and enterprise as it eliminates the requirement for users to plan ahead for provisioning, and allows enterprises to start from the small and increase resources only when there is a rise ...
متن کاملDigital still cameras and mobile agents: How to create a distributed service for image processing
The new distributed multimedia applications require more and more to manage user’s mobility. The opportunity of accessing the data at any time, from any place, and with terminals having several processing capabilities, is one of the most important features required. Adequate mechanisms need therefore to be developed, in order to manage the user’s mobility and the distributed processing of data ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PVLDB
دوره 7 شماره
صفحات -
تاریخ انتشار 2014